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Summary 

•  In  Primary  Auditory  Cortex  (AI)  of  ferrets,  we  characterize  the 
spectro-temporal  properties  of  cells’  responses.  We  find  that  the 
responses  correspond  to  temporal  modulations  from  4  to  40  Hz, 
and  spectral  modulations  from  0  to  2  cycles/octave,  in  the 
stimulus’  spectro-temporal  envelope. 

•  The  Spectro-Temporal  Response  Function  (STRF)  is  the  linear 
component  of  the  response.  It  is  an  excellent  predictor  of  response. 

•  Different  methods  of  determining  the  STRF  are  in  good  agreement, 
make  similar  (and  similarly  accurate)  predictions. 

•  Spike-Triggered  Averaging  is  an  effective  method  to  measure 
the  STRF,  when  used  with  Temporally  Orthogonal  Ripple 
Combinations  (TORCs)  as  stimuli. 

•  Spike-Triggered  Averaging  methods  do  not  depend  on  quadrant 
separability,  and  provide  a  good  method  for  seeking  non-quadrant 
separable  responses. 
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Auditory  Stages  and  Representations 


Cochlea 


•  Sound  is  bandpassed, 
half-wave  rectified,  and 
low-passed 


Inferior  Colliculus  Cortex 


envelope 
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O  envelope  of  envelope 
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•  Envelope  of  the  sound  is  bandpassed  or  •  Band-passed  envelope  from  IC  is  low- 

lowpassed,  extracting  the  first  (fast)  modu-  passed,  extracting  the  second  (slow) 
lation  of  the  sound  near  BF  modulation  of  the  sound  near  BF 


Each  stage  of  the  auditory  pathway  represents  the  auditory  stimulus  differently.  After 
cochlear  processing,  the  acoustic  waveform  is  represented  by  parallel,  frequency- 
ordered,  neural  signals,  which  can  encode  different  characteristics  of  the  sound  in 
different  areas.  By  primary  auditory  cortex,  responses  represent  the  slow  modulations  (a 
few  to  a  few  tens  of  Hertz)  of  the  envelope  of  the  signal. 

Spike-triggered  averaging,  a  method  for  extracting  the  linear  component  of  the  response, 
can  be  done  on  neural  signals  at  any  stage,  and  the  averaging  can  be  on  any 
representation  of  the  signal.  The  stimuli  must  be  “rich”  for  the  final  average  to  fully 
characterize  the  response. 


Acoustic  Waveform  Fast  Spectro-Temporal  Envelope  Slow  Spectro-Temporal  Envelope  Spikes  to  trigger  average 
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Introduction  to  Cortical  Responses  I 

Spike-triggered 
waveform 

0 

Inability  of  cortical  cells  to 
lock  to  waveform  usually 
renders  this  statistic  useless 

0  20  40  60  80  100 


Spike-triggered 
average  envelope 

Cortical  cells  lock  to  the  slow(er) 
modulations  of  the  envelope. 
The  average  does  not  give  much 
frequency  selectivity  information 


Spike-triggered 

average 

spectrogram 

Bandpassing  at  various 
frequencies  before  computing 
the  spike-triggered  average  of 
the  envelope  gives  more  detail, 
such  as  differentiating  between 
inhibitory  and  excitatory 
frequency  bands,  and  shows 
different  temporal  responses 
for  different  frequency  bands 
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Introduction  to  Cortical  Responses  II 

Spike-triggered 
spectro-temporal 
envelope  average 


Spike-triggered  averaging 
of  the  spectro-temporal 
envelope  directly  gives  a 
similar  spectro-temporal 
response  field  to  the  spike- 
triggered  average  of  the 
filter-bank  envelopes 
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STRF  from  Ripple 
Transfer  Funcion 

The  spectro-temporal  transfer 
function  is  compiled  by 
measuring  the  response 
amplitude  and  phase  to  single 
sinusoidally  modulated  spectra 
that  move  in  time  (ripples). 
The  spectro-temporal  response 
function  is  the  2-D  inverse 
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Motivation  and  Methods 

•  Primary  cortical  cells  prefer  spectral  envelope  modulations  from  4  to  40  Hz,  and  0  to  2 
cycles/octave. 

•  Ripple  transfer  functions,  the  Fourier  transform  of  the  Spectro-Temporal  Response 
Field  (STRF),  are  time-consuming  to  measure.  Assuming  separability  (or  at  least 
quadrant- separability),  measurement  time  is  reduced,  but  how  universal  is  separability? 

•  STRF  can  also  be  measured  by  spike-triggered  averaging  of  spectro-temporal 
representations  of  the  stimulus.  The  stimulus  must  be  “rich”  in  the  spectro-temporal 
modulations  that  characterize  the  response. 

•  Spike-triggered  averaging  does  not  rely  on  separability  and  can  be  used  to  test 
separability  directly. 

•  As  stimuli,  Temporally  Orthogonal  Ripple  Combinations  (TORCs)  cover  large  regions  of 
spectro-temporal  modulation  space  efficiently. 

•  Each  stimulus  is  composed  of  superposition  of  moving  sinusoids  (ripples) 

•  Stimuli  have  only  one  spectral  modulation  for  each  temporal  modulation,  but  many 
temporal  modulations,  each  a  multiple  of  the  base. 

•  Stimulus  components  are  orthogonal  over  the  averaging  interval. 

•  One-to-one  correspondence  between  stimulus  components  and  spectral  modulations 
removes  ambiguity  of  which  component  evokes  which  aspect  of  the  response  dynamics. 

•  Duplicating  each  stimulus  with  opposite  polarity  (overall  sign)  strongly  reduces  half¬ 
wave  rectification  non-linearity  (actually  all  even-order  non-linearities). 
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Spectro-Temporal  Modulations 


Moving  Ripple  in  Spectro-Temporal 
Space  (Spectrogram) 


Time  (ms) 


The  Fourier  transform  of  a  moving 
sinusoid  has  support  only  on  a  single 
point  (and  its  complex  conjugate). 


Moving  Ripple 
in  Fourier  Space 
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2-D  Decomposition  of  Broadband  Sound 


•  Frequencies  mapped 
along  cochlea  on  log 
frequency  axis 

•  Natural  sounds  dynamic, 
time  axis  required. 

•Use  two-dimensional 
functions  of  log(freq)  and 
time 

•  Analysis  is  often 
conceptually  simpler  in 
the  Fourier  domain. 


(A)  A  speech  fragment 
has  its  envelope  (B) 
Fourier  transformed.  The 
Fourier  transform  is  then 
approximated  by  its  100 
largest  components  in 
(C)  and  then  inverted 
back  in  (D),  giving  an 
excellent  approximation 
to  the  original  envelope. 
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Spectrogram  (log  frequency)  LPC  Envelope 


Ripple  Transform  ( 1 00  peaks)  Reconstruction 
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Spike  Averaged  Spectro-Temporal  Envelope 

Stimulus  No  8  Response  No.  8  (Normalized  PSTH) 
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Spike-Triggered  Avg.  Past  Stimulus  No.  8 
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Cochlear  Spectrogram 


Stimulus  Waveform  Cochlear  Filters 


Spectrogram 


The  cochlear  spectrogram  of  the  stimulus  is  obtained  by  passing  the  stimulus 
waveform  through  a  bank  of  cochlear  filters.  The  temporal  envelope  of  each  of 
the  filter  outputs  form  the  rows  of  the  cochlear  spectrogram. 
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Spike-Triggered  Average  Spectrogram 


Stimulus  No.  8  Response  No.  8  (Normalized  PSTH) 


Center  for  Auditory 
and  Acoustic  Research 


Time  (ms) 


Time  (ms) 

Institute  for  Systems  Research 
University  of  Maryland 


—  Measurements  by  Ripple  Velocity 

Ripple  Velocity  (Hz)  Ripple  Frequency  is  0.4  cyc/oct  70  dB 


which  can  be  inverse  Fourier  transformed 
to  give  Impulse  Responses  in  (D). 
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Measurements  by  Ripple  Frequency 


A 

Ripple  Frequency  (cyc/oct) 


Ripple  Velocity  is  8  Hz 


70  dB 
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Ripple  Frequency  (cyc/oct) 

Spike  events  in  (A)  are  turned  into  period 
histograms  in  (B).  The  amplitudes  and 
phases  give  the  tranfer  function  in  (C), 
which  can  be  inverse  Fourier  transformed 
to  give  Response  Fields  in  (D). 
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Spectro-Temporal  Transfer  Function 

•  Spectro-temporal  response  field  of  neuron  is  the  usual  response  field  made  time-dependent. 

•  Its  Fourier  transform  is  the  transfer  function. 

•  Either  can  be  used  to  predict  the  response  to  any  broadband  dynamic  sound. 


Spectro-Temporal  Response 
Function  (STRF)  of  a  neuron 


2  Dimensional  Transfer 
Function  of  the  same  neuron 


Fourier  Transform 


[.]  exp(±2jE/'Qx±27E/wO 


Inverse  Transform 
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Frequency  (Hz)  Frequency  (Hz) 


Spectro-Temporal  Responses  Compared  I 


Inverse  Fourier  transform  of 


Spike-triggered  average  Spike-triggered  average 

spectro-temporal  envelope  low-passed  spectrogram 
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Frequency  (Hz)  Frequency  (Hz) 


Spectro-Temporal  Responses  Compared  II 


Inverse  Fourier  transform  of  Spike-triggered  average 

ripple  transfer  function  _  spectro-temporal  envelope 
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Spectro-Temporal  Responses  Compared 


4000 


2000 


1000 


500 


250 


125 


Inverse  Fourier  transform  of 
ripple  transfer  function _ 


226/25 a  QSP 


8000 


4000 


2000 


1000  - 


500 


250 


Spike-triggered  average 
spectro-temporal  envelope 


Spike-triggered  average 
low-passed  spectrogram 


100  150 

Time 
(ms) 


200  250 


On 

\J  Ce 


Center  for  Auditory 
and  Acoustic  Research 


i 


i 


Time 

(ms) 


Time 

(ms) 


Institute  for  Systems  Research 
University  of  Maryland 


Linearity  in  Theory 


Assuming  linearity,  the  STRF  predicts  the  response  to  any  broadband  dynamic  stimulus, 
including  single  ripples  moving  in  either  direction  (first  two  rows)  and  combinations  of 
upward  and  downward  moving  ripples. 
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Linearity  in  Practice 


The  correlation  between  predicted  and  actual  response  is  quite  good  for 
most  cells.  Since  cells  cannot  fire  at  negative  rates,  any  prediction  should 
be  half-wave  rectified  before  comparing  to  the  actual  response. 


Stimulus  Spectrogram  STRF  Response 
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Fast  Responses 


Some  units  respond  well  at  time  scales  as  fast  as  ~10  ms.  This  is  seen  both  in  the  raster 
plot  and  in  the  STRF.  When  the  output  of  the  filter  bank  is  low-passed  at  25  Hz,  the 
resulting  STRF  looks  much  more  like  the  Spectro-Temporal  Envelope  generated 
STRF,  which  contains  temporal  modulation  only  up  to  24  Hz  (in  this  case). 
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Raster  Plot 


Spike-triggered 
Cochlear  Filter 
average 


Spike-triggered 
Low-Passed  Cochlear  Filter 

average 


Spike-triggered 
Spectro-Temporal  Envelope 

average 
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Quadrant  Separability 

An  STRF  can  fall  into  one  of  Spectro-Temporal  Domain 

three  categories: 


Fourier  Domain 


•  Non-separable:  The  transfer 
function  is  an  arbitrary 
(complex-conjugate  symmetric) 
function  of  ripple  frequency  and 
ripple  velocity. 


•  Quadrant  separable:  The 

transfer  function  within  each 
quadrant  is  a  product  of  a  func¬ 
tion  of  ripple  frequency  and  a 
function  of  ripple  velocity.  The 
envelope  of  the  STRF  is  a  sim¬ 
ple  product  of  a  function  of 
spectrum  and  a  function  of  time. 


•  Fully  separable:  The  transfer 
function  is  the  product  of  a  func¬ 
tion  of  ripple  freqency  and  rip¬ 
ple  velocity  everywhere.  The 
resulting  STRF  is  a  product  of  a 
function  of  spectrum  and  a  func¬ 
tion  of  time. 
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Separability  in  STRFs 

Examples  of  Experimentally  obtained  STRFs 


Separable 


Non- separable 
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Note  the  variety  of  spectral  and  temporal  behaviors 
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Cortical  Filter  Model 


Response  fields  in  AI  have  charac¬ 
teristic  shapes  both  spectrally  and 
temporally. 


Theoretical  ripple  filters  used  to  generate  a 
‘cortical  representation’ 


•  AI  cells  respond  well  only  to  a  small 
set  of  moving  ripples  around  a  par¬ 
ticular  spectral  peak  spacing  and 
velocity. 

•We  find  cortical  cells  with  all  center 
frequencies,  spectral  symmetries, 
bandwidths,  latencies  and  temporal 
impulse  response  symmetries. 

•  Therefore  AI  decomposes  the  input 
spectrum  into  different  spectrally 
and  temporally  tuned  channels. 

•  Equivalently,  a  population  of  cells, 
tuned  around  different  moving  rip¬ 
ple  parameters,  can  effectively  rep¬ 
resent  the  input  spectrum  at  multiple 
scales. 
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—  The  Cortical  Representation  _ 

Spectrally  narrow  cells  pick  out  the  fine  features  of  the  spectral  profile,  whereas  broadly  tuned 
cells  pick  out  the  coarse  outlines  of  the  spectrum.  Similarly,  dynamically  sluggish  cells  will 
respond  to  the  slow  changes  in  the  spectrum,  whereas  fast  cells  respond  to  rapid  onsets  and 
transitions.  In  this  manner,  AI  is  able  to  encode  multiple  views  of  the  same  dynamic  spectrum. 
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